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4 <110> APPLICANT: Skolnick, Jeffrey 

5 Fetrow, Jacquelyn S. 

7 <120> TITLE OF INVENTION: METHODS AND SYSTEM FOR PREDICTING 

8 PROTEIN FUNCTION 
10 <130> FILE REFERENCE: 10886-047001 

12 <140> CURRENT APPLICATION NUMBER: 09/322, 067A 

13 <141> CURRENT FILING DATE: 1999-05-27 

15 <150> PRIOR APPLICATION NUMBER: 60/099,300 C M T P" H 

IM I t R F D 



16 <151> PRIOR FILING DATE: 1998-08-25 CI | \J J £ H F" |) 

18 <150> PRIOR APPLICATION NUMBER: 60/120,311 " ^ 

19 <151> PRIOR FILING DATE: 1999-02-16 
21 <160> NUMBER OF SEQ ID NOS : 12 

23 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 130 

27 <212> TYPE: PRT 

28 <213> ORGANISM: Haemophilus influenzae 

30 <400> SEQUENCE: 1 

31 Met Met Tyr Ser Lys Leu Leu Thr Leu Thr Thr Leu Leu Leu Pro Thr 

32 1 5 10 15 

33 Ala Leu Ala Leu Pro Ser Leu Val Glu Arg Ala Cys Asp Tyr Thr Cys 

34 20 25 30 

35 Gly Ser Asn Cys Tyr Ser Ser Ser Asp Val Ser Thr Ala Gin Ala Ala 

36 35 40 45 

37 Gly Tyr Gin Leu His Glu Asp Gly Glu Thr Val Gly Ser Asn Ser Tyr 

38 50 55 60 

39 Pro His Lys Tyr Asn Asn Tyr Glu Gly Phe Asp Phe Ser Val Ser Ser 

40 65 70 75 80 

41 Pro Tyr Tyr Glu Trp Pro lie Leu Ser Ser Gly Asp Val Tyr Ser Gly 

42 85 90 95 

43 Gly Ser Pro Gly Ala Asp Arg Val Val Phe Asn Glu Asn Asn Gin Leu 

44 100 105 110 

45 Ala Gly Val He Thr His Thr Gly Ala Ser Gly Asn Asn Phe Val Glu 

46 * 115 120 125 

47 Cys Thr 

48 130 

50 <210> SEQ ID NO: 2 

51 <211> LENGTH: 107 

52 <212> TYPE: PRT 

53 <213> ORGANISM: Haemophilus influenzae 

55 <400> SEQUENCE: 2 

56 Gin Ser Ala Thr Thr Cys Gly Ser Thr Asn Tyr Ser Ala Ser Gin Val 

57 1 5 10 15 

58 Arg Ala Ala Ala Asn Ala Ala Cys Gin Tyr Tyr Gin Asn Asp Asp Ser 

59 20 25 30 

60 Ala Gly Ser Thr Thr Tyr Pro His Thr Tyr Asn Asn Tyr Glu Gly Phe 

61 35 40 45 
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92 <210> SEQ ID NO: 4 

93 <211> LENGTH: 113 

94 <212> TYPE: PRT 

95 <213> ORGANISM: Haemophilus influenzae 

97 <4 00> SEQUENCE: 4 

98 Cys Asn lie Pro Glu Ser Thr Asn Cys Gly Gly Asn Val Tyr Ser Asn 
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115 <210> SEQ ID NO: 5 

116 <211> LENGTH: 104 
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117 <212> TYPE: PRT 

118 <213> ORGANISM: Haemophilus influenzae 

120 <400> SEQUENCE: 5 

121 Asp Cys Asp Tyr Thr Cys Gly Ser His Cys Tyr Ser Ala Ser Ala Val 

122 15 10 15 

123 Ser Asp Ala Gin Ser Ala Gly Tyr Gin Leu Glu Ser Ala Gly Gin Ser 

124 20 25 30 

125 Val Gly Arg Ser Arg Tyr Pro His Gin Tyr Arg Asn Tyr Glu Gly Phe 

126 35 40 45 

127 Asn Phe Pro Val Ser Gly Asn Tyr Tyr Glu Trp Pro He Leu Ser Ser 

128 50 55 60 

129 Gly Ser Thr Tyr Asn Gly Gly Gly Pro Gly Ala Asp Arg Val Val Phe 

130 65 70 75 80 

131 Asn Asp Asn Asp Glu Leu Ala Gly Leu He Thr His Thr Gly Ala Ser 

132 85 90 95 

133 Gly Asp Gly Phe Val Ala Cys Tyr 

134 100 

136 <210> SEQ ID NO: 6 

137 <211> LENGTH: 102 

138 <212> TYPE: PRT 

139 <213> ORGANISM: Haemophilus influenzae 

141 <400> SEQUENCE: 6 

142 Ala Cys Ala Ala Thr Cys Gly Thr Val Cys Tyr Thr Ser Ser Ala He 

143 15 10 15 

144 Ser Ser Ala Gin Ala Ala Gly Tyr Asn Leu Tyr Ser Thr Asn Asp Asp 

145 20 25 30 

146 Val Ser Asn Tyr Pro His Glu Tyr His Asn Tyr Glu Gly Phe Asp Phe 

147 35 40 45 

148 Pro Val Ser Gly Thr Tyr Tyr Glu Phe Pro He Leu Lys Ser Gly Lys 

149 50 55 60 

150 Val Tyr Thr Gly Ser Ser Pro Gly Ala Asp Arg Val He Phe Asn Asp 

151 65 70 75 80 

152 Asp Asp Glu Leu Ala Gly Val He Thr His Thr Gly Ala Ser Gly Asn 

153 85 90 95 

154 Asn Phe Val Ala Cys Thr 

155 100 

157 <210> SEQ ID NO: 7 

158 <211> LENGTH: 102 

159 <212> TYPE: PRT 

160 <213> ORGANISM: Haemophilus influenzae 

162 <400> SEQUENCE: 7 

163 Ala Cys Ala Ala Thr Cys Gly Ser Val Cys Tyr Thr Ser Ser Ala He 

164 15 10 15 

165 Ser Ala Ala Gin Glu Ala Gly Tyr Asp Leu Tyr Ser Ala Asn Asp Asp 

166 20 25 30 

167 Val Ser Asn Tyr Pro His Glu Tyr Arg Asn Tyr Glu Gly Phe Asp Phe 

168 35 40 45 

169 Pro Val Ser Gly Thr Tyr Tyr Glu Phe Pro He Leu Arg Ser Gly Ala 

170 50 55 60 
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171 Val Tyr Ser Gly Asn Ser Pro Gly Ala Asp Arg Val Val Phe Asn Gly 

172 65 70 75 80 

173 Asn Asp Gin Leu Ala Gly Val He Thr His Thr Gly Ala Ser Gly Asn 

174 85 90 95 

175 Asn Phe Val Ala Cys Asp 

176 100 

178 <210> SEQ ID NO: 8 

179 <211> LENGTH: 104 

180 <212> TYPE: PRT 

181 <213> ORGANISM: Haemophilus influenzae 

183 <400> SEQUENCE: 8 

184 Ala Cys Met Tyr He Cys Gly Ser Val Cys Tyr Ser Ser Ser Ala He 

185 15 10 15 

186 Ser Ala Ala Leu Asn Lys Gly Tyr Ser Tyr Tyr Glu Asp Gly Ala Thr 

187 20 25 30 

188 Ala Gly Ser Ser Ser Tyr Pro His Arg Tyr Asn Asn Tyr Glu Gly Phe 

189 35 40 45 

190 Asp Phe Pro Thr Ala Lys Pro Trp Tyr Glu Phe Pro He Leu Ser Ser 

191 50 55 60 

192 Gly Arg Val Tyr Thr Gly Gly Ser Pro Gly Ala Asp Arg Val He Phe 

193 65 70 75 80 

194 Asp Ser His Gly Asn Leu Asp Met Leu He Thr His Asn Gly Ala Ser 

195 85 90 95 

196 Gly Asn Asn Phe Val Ala Cys Asn 

197 100 

199 <210> SEQ ID NO: 9 

200 <211> LENGTH: 105 

201 <212> TYPE: PRT 

202 <213> ORGANISM: Haemophilus influenzae 

204 <400> SEQUENCE: 9 

205 Gin Gly Gly Val Ser Val Asn Cys Gly Gly Thr Tyr Tyr Ser Ser Thr 

206 15 10 15 

207 Gin Val Asn Arg Ala He Asn Asn Ala Lys Ser Gly Gin Tyr Ser Ser 

208 20 25 30 

209 Thr Gly Tyr Pro His Thr Tyr Asn Asn Tyr Glu Gly Phe Asp Phe Ser 

210 35 40 45 

211 Asp Tyr Cys Asp Gly Pro Tyr Lys Glu Tyr Pro Leu Lys Thr Ser Ser 

212 50 55 60 

213 Ser Gly Tyr Thr Gly Gly Ser Pro Gly Ala Asp Arg Val Val Tyr Asp 

214 65 70 75 80 

215 Ser Asn Asp Gly Thr Phe Cys Gly Ala He Thr His Thr Gly Ala Ser 

216 85 90 95 

217 Gly Asn Asn Phe Val Gin Cys Ser Tyr 

218 100 105 

220 <210> SEQ ID NO: 10 

221 <211> LENGTH: 177 

222 <212> TYPE: PRT 

223 <213> ORGANISM: Haemophilus influenzae 
225 <400> SEQUENCE: 10 
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